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Background Human ageing is a complex, multifactorial process and eariy devel- 
opmental factors affect health outcomes in old age. 

Methods Metabolomic profiling on fasting blood was carried out in 6055 
individuals from the UK. Stepwise regression was performed to 
identify a panel of independent metabolites which could be used 
as a surrogate for age. We also investigated the association with 
birthweight overall and within identical discordant twins and with 
genome-wide methylation levels. 

Results We identified a panel of 22 metabolites which combined are strongly 
correlated with age (R'^ = 59%) and with age-related clinical traits in- 
dependently of age. One particular metabolite, C-glycosyl tryptophan 
(C-glyTrp), correlated strongly with age (beta = 0.03, SE = 0.001, 
P = 7.0x 10"^") and lung function (FEVj beta = -0.04, SE = 0.008, 
P= 1.8 X 10~® adjusted for age and confounders) and was replicated 
in an independent population {n = 887). C-glyTrp was also associated 
with bone mineral density (beta = -0.01, SE = 0.002, P= 1.9 x 10"^) 
and birthweight (beta = -0.06, SE = 0.01, P = 2.5 x lO"*^). The differ- 
ence in C-glyTrp levels explained 9.4% of the variance in the difference 
in birthweight between monozygotic twins. An epigenome-wide as- 
sociation study in 172 individuals identified three CpG-sites, asso- 
ciated with levels of C-glyTrp (P < 2 x 10^*^). We replicated one CpG 
site in the promoter of the WDR85 gene in an independent sample of 



nil 



1112 INTERNATIONAL JOURNAL OF EPIDEMIOLOGY 



350 individuals (beta=-0.20, SE = 0.04, P = 2.9 x 10"*^). WD R8 5 is a 
regulator of translation elongation factor 2, essential for protein syn- 
thesis in eukaryotes. 

Conclusions Our data illustrate how metabolomic profiling linked with epigen- 
etic studies can identify some key molecular mechanisms poten- 
tially determined in early development that produce long-term 
physiological changes influencing human health and ageing. 

Keywords Ageing, metabolomics, epigenetics, twin studies, developmental ori- 
gins of health and disease, birthweight 



Introduction 

Human ageing is a multifactorial process influenced by 
genetic, lifestyle and environmental factors. Genetic 
variation explains only ~20-25% of the variability of 
human survival to the mid 80s,''~ implying the exist- 
ence of molecular changes over time which must relate 
to environmental, epigenetic and lifestyle factors. 

Several markers for the ageing process, such as 
telomere length and cuxulating levels of dehydroe- 
piandrosterone sulphate (DHEAS), have been identi- 
fied previously.' '^ Peripheral blood leukocyte telomere 
length has been shown to be a systemic marker for 
biological ageing.^ DHEAS is a major but poorly 
understood circulating steroid in human blood, 
whose levels are known to strongly correlate with age. 

The biochemistry of ageing is complex, with 
biologically significant changes occurring in many dif- 
ferent types of molecules.'' Metabolomics is a novel 
technology which aims to profile all low-molecular- 
weight metabolites that are present in biological 
samples which can investigate the molecular changes 
seen with ageing. Recently, a study on ageing in 2162 
normal individuals with a wide age range (32-81 
years) using a panel of 163 metabolites showed that 
metabolic profiles are strongly correlated with age.'' 
A small study of 269 individuals using a broader 
metabolomic platform also reported correlations with 
age for 51 metabolites (P<0.05 and false discovery 
rate Q < 0.15).^ 

In this study we assess, using a non-targeted meta- 
bolomic platform, the extent to which metabolomic 
profiles are correlated with chronological age and 
ageing-related traits in a large twin population 
{n — 6055). We further examine the role of specific 
metabolites on a likely developmental determinant 
of healthy ageing (i.e. birthweight) and investigate 
the potential pathways involved by using epigen- 
ome-wide association study (EWAS) data. 



Methods 

A flow chart of the study rationale is presented in 
Supplementary Figure 1, available as Supplementary 
data at IJE online. 



Discovery cohort 

Study subjects were twins enrolled in the TwinsUK 
registry, a national register of adult twins.' In this 
study we analysed data from 6055 twins with meta- 
bolomic profiling available. 

The replication cohort 

consisted of individuals from the follow-up study 
KORA F4 (Cooperative Health Research in the 
Region of Augsburg ) drawn from the general popula- 
tion of the region of Augsburg, Germany.'" In total, 
887 individuals with fasting serum metabolomic 
profiles available using the Metabolon platform and 
measures of FEVi were analysed. 

Phenotype definition 

Data were included on body mass index (BMI, body 
weight in kilograms divided by height in square 
metres), forced expiratory volume in 1 s (FEVi), 
forced expiratory vital capacity (FVC), hip bone mineral 
density (BMD), systolic and diastolic blood pressure 
(SDP and DBF, total and HDL cholesterol, albumin, 
leukocyte telomere length and dehydroepiandrosterone 
sulfate (DHEAS). For details on phenotype measure- 
ments see Supplementary Appendix I, available as 
Supplementary data at IJE online. 

Metabolomics measurements 

Non-targeted mass spec-based metabolomic profiling 
was performed on 1052 fasting serum samples and 
5003 fasting plasma samples from participants in 
the TwinsUK study, using the Metabolon platform, 
as described previously.''''^ Full details and quality 
control are included in the supplementary data as 
Supplementary Texts 2 and 3, available at IJE 
online. In all, 280 known metabolites were measured 
in the 1052 serum samples, and 281 known and 175 
unknown metabolites were measured in the 5003 
plasma samples. 

Statistical analysis 

Statistical analysis was carried out using Stata 
version II. 
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Step 1. Identify metabolites associated with 
chronological age 

For each metabolite, we calculated the residuals by 
running linear regression adjusting for sex, BMI and 
batch effect. Linear regressions for each metabolite 
residuals accounting for familial relatedness using 
random intercept linear regression were carried out: 

E(Xij) =/Jo + Ajageij + fj+£ij (1) 

where Xy is the metabolite residual of twin j from pair 
i; Q is the family- specific error component which 
represents the omitted family characteristics or 
unobserved heterogeneity. We accounted for multiple 
testing using Bonferroni correction (P= 0.05/280 
metabolite residuals). 

Step 2. Identify a metabolite panel of independent 
metabolites associated with chronological age 
Although the hypothesis underlying our work is that 
metabolite levels change with age, in order to inves- 
tigate which compounds are correlated independently 
of each other with age we carried out a stepwise 
linear regression, with age as the dependent variable. 
This was performed on all the metabolite residuals 
identified in Step 1, to look for a panel of indepen- 
dent metabolites which could be used as a surrogate 
for age. We use as cutoff being removed from the 
model 0.0000001. Adjusting for serum/plasma did 
not change the results. 

Step 3. Test for association with mortality 

A Cox regression model was used to estimate the 
proportional hazards ratio for mortality as a function 
of the linear combination of 22 metabolites. The vari- 
able derived in Figure 1 was used as the independent 
variable and death was the outcome variable (mean 
follow-up time 7.33 years, SD 4.46). 

Step 4. Assess whether the metabolite panel associated 
with chronological age is also associated with known 
age-related traits 

We tested for association with age-related traits: 
telomere length, SEP, DBF, FEVi, FVC, HMD, 
DHEAS, total and HDL cholesterol and albumin, com- 
paring the model: 

E(ageing — relatedtraitij) = /Sq + /^iChronological age^ 

+ + H 

(2) 

with the model: 

E (ageing — relatedtraity) 
= Pq + /JjChronological agej, 

(3) 

+ yjmetabolitel + .... + 3jmetabolite22 

+ + 

where is the family- specific error component. 
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Figure 1 Metabolite profile measures and age. 
The metabolite profile measure was calculated for each 
study participant using the coefficients from the stepwise 
regression on age of the 22 metabolites in Table 1. 
The mean and standard error of this variable was then 
computed for 5 years of age intervals in the study sample 



We calculated the coefficient of determination 
and the Akaike information criterion (AIC) to com- 
pare the proportion of variance explained by models 
(2) and (3) and to assess which of the models better 
fitted the data. 

Step 5. Replication in the KORA sample of C-glyTrp 

Linear regressions between C-glyTrp and age and 
FEVi were performed. Both analyses were adjusted 
for sex and BMI. Linear regression for FEVi was 
also adjusted for age and height. Meta-analyses of 
the combined discovery and validation results were 
conducted using inverse-variance fixed-effects meta- 
analysis. 

Step 6. Heritability of C-glyTrp in TwinsUK 

We estimated heritability using structural equation 
modelling to separate the observed phenotypic var- 
iance into three latent sources of variation: additive 
genetic variance (A), shared/common environmental 
variance (C), and non- shared/unique environmental 
variance (E). 

Step 7. Discordant twin analysis 

We selected 86 monozygotic twins with a minimum 
difference in btrthweight of 750 g (which corresponds 
to 2SD of the average difference between twins-mean 
difference at 1060 g, SD difference 320 g). A linear 
regression was performed on the difference in circu- 
lating levels of C-glyTrp and the difference in 
birthweight. 

Step 8. Epigenetic study of C-glyTrp in TwinsUK 

DNA methylation levels were obtained using the 27k 
lUumina CpG methylation probe array in 172 female 
twins aged 32 to 80 years, randomly selected from the 
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discovery cohort. QC measures were applied, as pre- 
viously described'' and 24 641 autosomal probes 
passed quality control. We tested for association 
between whole-blood DNA methylation patterns and 
C-glyTrp, adjusting for age, sex, BMI, metabolomic 
batch, methylation chip, sample position on methyla- 
tion chip and family relatedness. Adjusting for zygos- 
ity did not change the results. 

We followed up the association between C-glyTrp 
metabolite levels with DNA methylation levels at 
three probes in an independent sample of 350 indivi- 
duals from the TwinsUK cohort. DNA methylation 
levels in the foUow-up sample were obtained using 
the lUumina Infinium 450k array.'"' We meta-analysed 
results across the discovery (n=n2) and replication 
{n — 350) samples using inverse-variance fixed-effect 
meta-analysis. Probes in both the discovery and replica- 
tion samples were standardized to have mean zero and 
variance 1. 



Results 

A total of 6055 individuals aged 17-85 years from the 
TwinsUK cohort were included in the analysis of 280 
blood metabolites. The demographic characteristics of 
the study population are presented in Supplementary 
Table SI (available as Supplementary data at IJE 
online). 

Age was found to correlate with 165 metabolites 
after accounting for multiple testing (P < 1.8 x 10""*), 
famUy relatedness, sex and body mass index 
(BMI) (see Supplementary Table S2, available as 
Supplementary data at IJE online). The majority of 
these metabolites were related to lipid pathways 
(73) and amino acid pathways (49). There were also 
1 1 carbohydrates, 9 xenobiotics, 7 nucleotides, 6 inter- 
mediates in the energy pathway, 6 cofactors and vita- 
mins and 4 peptides. 

After a stepwise linear regression including these 
165 metabolites, we identified a panel of 22 indepen- 
dent metabolites associated with age (Table 1), 
achieving an R^=59%. The majority of metabolites 
were lipids (nine) and amino acids (seven) plus two 
intermediates in the energy pathway, two xenobiotics, 
one carbohydrate and one nucleotide. Many of the 
identified correlations between age and compounds 
have been previously described in the literature such 
as that with steroids,'^ creatinine''^ and citruUine.'^ 
This serves as a proof of principle for the other meta- 
bolites identified here. 

For each study participant we then added the 22 
metabolites using the regression coefficient, and cre- 
ated a metabolite-derived age variable. Plotting this 
variable in 5 -year age groups shows a clear linear 
relationship with age (Figure 1). The value of this 
combination of 22 metabolites as surrogate measure 
of chronological age is confirmed by testing their rela- 
tionship to mortality using data on 188 death events 
over an average follow-up time of 7.4 years. The 



variable created in Figure 1 results in a Cox propor- 
tional hazards ratio for death HR= 1.08 (95% CI 
1.06-1.09, P< I X lO"^*") per 'metabolic age' year. An 
important question is whether these metabolites con- 
tribute additional information to existing age-related 
traits. We ran linear regressions on age and the 22 
independently associated metabolite residuals, adjust- 
ing for each of them in turn (see Supplementary 
Tables S3 and S4, available as Supplementary data 
at IJE online). All metabolites remained associated 
(P < I X 10"'') with age when adjusting for telomere 
length and all but two (both steroids) when adjusting 
for DHEAS. 

Given the strong association with age, we hypothe- 
sized that these age-correlated 22 metabolites may be 
predictors of age-related traits after adjustment for 
chronological age. We considered the following 
traits: telomere length, systolic and diastolic blood 
pressure, two measures of lung function (FEVi 
and FVC, HMD, DHEAS, total and HDL cholesterol 
and albumin, all of which have been previously 
reported to be strongly associated with age."*""' 

These 22 metabolites are not, after adjusting for age, 
all equally associated with the various ageing related 
traits (Supplementary Tables 5-10, available as 
Supplementary data at IJE online), but adding all 
22 metabolites to the model E [(ageing-related 
traitjj ) = Po + PiChronological agcy + + Sy] increased 
the proportion of explained variance for each 
ageing-related trait. In particular, it added 2% to the 
total variance explained for FEVi and FVC, 2.5% for 
BMD, 4% for both SEP and DBP and for albumin, 5% 
for telomere length, 26.6% for DHEAS, 23.7% for HDL 
cholesterol and 46.33% for total cholesterol. The 
model including the 22 metabolites had in all cases 
the best fit (a lower AIC), indicating that the panel 
reflects the process of biological ageing after adjust- 
ment for chronological age. 

From the main analysis (Table I), the association with 
the level of C-glyTrp is of particular interest as it has not 
previously been reported to be an age-related metabolite 
and may point to novel molecular pathways involved in 
ageing. On its own C-glyTrp is correlated with age 
(beta = 0.03, SE = 0.001, P=7.0 x 10"'"), and age- 
related traits such as lung function (FEVi 
beta = -0.04, SE = 0.008, P= 1.8x10"** adjusted for 
age and confounders) and BMD (beta =—0.01, 
SE = 0.002, P= 1.9 x 10"*^). Moreover it is strongly 
and independently correlated with chronological age 
after adjusting for the presence of the other 21 meta- 
boHtes (beta = 2.47, SE = 0.24, P= 1.3 x 10"^') and 
with FEVi (beta = -0.05, SE = 0.01, P=2.9xl0"'), 
BMD (beta=-0.02, SE = 0.003, P = 4.8xl0"') 
(Table I), albumin (beta=-0.39, SE = 0.09, P=1.6x 
10"') (Supplementary Table S5, available as 
Supplementary data at IJE online), total cholesterol 
(beta=-0.11, SE = 0.03, P=2.7x 10"') and HDL cho- 
lesterol (beta=-0.06, SE = 0.01, P=7.8xl0"') 
(Supplementary Table S6, available as Supplementary 
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Table 2 Meta-analysis results for C-glyTrp with clironological age and FEVi; beta, SE and P-values are reported also for the 
discovery and replication cohorts. All analyses are adjusted for sex, BMI, batch effect and family relatedness. FEVi is also 
adjusted for chronological age and height" 



Phenotype 




Discovery 




Replication 




Fixed effect 


beta 


SE P 


beta 


SE P 


beta 


SE P 


Chronological age 


0.03 


0.001 7.0x10"'" 


0.004 


0.001 4.14x10"** 


0.012 


0.001 2.29 x 10"'°' 


FEVi 


-0.04 


0.008 1.80 x 10"*' 


-0.037 


0.019 4.87x 10"^ 


-0.04 


O.OI 2.43 X 10"'' 



"The betas, SEs and P-values are not adjusted for the presence of other metabolites. 



data at IJE online), FVC (beta =-0.04, SE = 0.01, 
P= 6.5x10"') (Supplementary Table S7, available as 
Supplementary data at IJE online), SBP (beta =—0.74, 
SE = 0.39, P=5.50xlO"^) and DBF (beta=-0.86, 
SE = 0.27, P= 1.4x10"') (Supplementary Table S8, 
available as Supplementary data at IJE online). 

As this metabolite is potentially a novel marker of 
ageing, we replicated the association in an independent 
population (KORA« = 887, Germany). Supplementary 
Table Sll (available as Supplementary data at IJE 
online) shows the demographic characteristics of the 
replication cohort. C-glyTrp is associated with chrono- 
logical age in KORA (beta = 0.004, SE = 0.001, 
P = 4.14x 10"*) and it is associated on meta-analysis 
(beta = 0.012, SE = 0.001, P = 2.29 x 10"""). It is also 
associated with FEVi in the KORA cohort 
(beta = -0.037, SE = 0.019, P = 4.87x 10"^) and asso- 
ciated on meta-analysis (fixed effects: beta = —0.04, 
SE = 0.01, P = 2.43 X 10"'') (see Table 2). 

Given its consistent association in two independent 
cohorts, we further explored its biology. 

Serum levels of 2-(a-mannopyranosyl)-L-tryptophan 
have been shown to be a more accurate measure of 
renal function than serum creatinine concentration, 
and renal function can be compared in subjects inde- 
pendently of age and muscle mass when 2-(a-manno- 
pyranosyl)-L-tryptophan concentration is measured. 
In the discovery cohort we find that levels of C-glyTrp 
are correlated with those of serum creatinine (squared 
correlation coefficient R^ = 0.I7, P< 0.0001). 

Tryptophan metabolism has already been indicated 
as a pathway affected by ageing in a metabolomic 
study of murine livers,^' but the association we see 
is strongest with glycosylated tryptophan, C-glyTrp. 

We assessed possible genetic variants associated 
with levels of this metabolite by looking at previously 
published genome-wide association scan data from 
both the KORA and the TwinsUK cohorts.^'' 
However, we failed to find any genome-wide signifi- 
cant genetic associations with this metabolite^'' and, 
despite reasonable statistical power, we did not find 
any association with chaperone proteins or glycosyla- 
tion-related protein encoding genes which would be 
the obvious genetic determinants. 

Taking advantage of the twin nature of our data we 
ran a heritability analysis^' (I3I9 MZ pairs, 1256 DZ 
pairs) and found that the levels of this metabolite 
have a heritability (h^) of 28%, meaning that 72% of 



the variance in its levels is not defined by a common 
genetic component. Thus, levels of this marker could 
be to a large extent environmentally or epigenetically 
mediated. 

It has been postulated that the health outcomes 
during ageing can be determined during early devel- 
opment.^^ The epidemiological observations that low 
birthweight is associated with increased rates of cor- 
onary heart disease, stroke, type 2 diabetes adiposity, 
the metabolic syndrome and osteoporosis in adult 
life"'' have been extensively replicated. 

Vv'e hypothesized that C-glyTrp might be related also 
to early development. This metabolite is also strongly 
associated with lower weight at birth in our twin 
population (beta = -0.06, SE = O.OI, P = 2.5 x lO"**), 
and birthweight explained 0.88% (R^ = 0.0088) of 
the variance in adult levels of C-glyTrp, suggesting 
that levels of this metabolite may reflect reduced 
growth and/or accelerated ageing in adult life. 

In twins there is considerable variation and on aver- 
age a birthweight difference of ~290 g^'' within a 
pair. Interestingly, having selected 85 monozygotic 
twins (i.e. genetically identical) highly discordant for 
weight at birth (750 g, 2SD of the average difference 
between twins), we find a correlation (R^ = 0.094, 
P < 0.004) between the difference in the levels of 
C-glyTrp (adjusted for confounders) and the differ- 
ence in the weight at birth between twins (Figure 
2). These data give further support to a non-genetic 
contribution to the levels of C-glyTrp which appear to 
be influenced by early development. 

Given the low heritability with this marker and the 
association of C-glyTrp with birthweight differences, 
we hypothesized that it may be epigenetically 
mediated. We compared metabolite levels at 
C-glyTrp with genome-wide DNA methylation profiles 
from the lUumina HumanMethylation27 DNA 
Analysis BeadChip assay in 172 individuals from the 
discovery cohort." The analyses were adjusted for 
age, sex, BMI, metabolomic batch, methylation chip, 
sample position on methylation chip and family 
relatedness. We found 3CpG-sites (cgI2757143, 
cg2036796I, cg25999867) at which DNA methylation 
levels were associated with levels of C-glyTrp with 
P< 2 X 10"" (see Table 3). 

We followed up the association between DNA 
methylation levels at the three probes with C-glyTrp 
metabolite levels in a replication sample of 350 
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additional individuals from the TwinsUK cohort. In 
two of the three probes the direction of association 
in both the discovery and the replication sets was the 
same, and for one of the probes (cgl2757143) the 

A 4n 

p<0.004 
3 - r2=0.094 



2 




-2 ■ 



A weight at birth (kg) 



B 2.5 1 




weight at birth (kg) 



Figure 2 (A) Correlation between the difference in levels 
of C-glyTrp and the difference in birthweight in 85 mono- 
zygotic twins discordant for birthweight (minimum differ- 
ence 750 g) (B) Relationship between C-glyTrp levels and 
weight at birth in the 20 most discordant monozygotic twin 
pairs. The twin in the pair with the highest birthweight is 
shown in dark circles, the one with lower birthweigh in 
white circles. Each twin pair is connected by a line 



significance level after adjustment for potential con- 
founders achieved Bonferroni significance 
(P< 1.3 X 10"^) (Table 3). 

Probe cgl2757143 maps to the promoter of the 
WDR85 gene in chromosome 9. WDR85 is a WD 
repeat-containing protein that plays a role in the 
first step of diphthamide biosynthesis.^*^ The transla- 
tion elongation factor 2 in eukaryotes (eEF-2) con- 
tains a unique post-translationally modified histidine 
residue, termed diphthamide, which serves as the 
only target for diphtheria toxin and Pseudomonas 
aeruginosa exotoxin A. 

Diphtheria toxin and exotoxin A inhibit host trans- 
lation through ADP ribosylation of eEF2 and cause 
cell death by inducing apoptosis.^' ADP ribosylation 
occurs on diphthamide, a post-translationally modi- 
fied histidine uniquely present in EF2 and conserved 
among all eukaryotes. Importantly the diphthamide 
modification on eEF2 has recently been shown to be 
essential for mRNA translation and embryonic devel- 
opment in mice.'" 

Therefore we find that epigenetic regulation at a 
gene implicated in the regulation of eEF2 which is 
key for cell cycle and embryonic development is 
associated with circulating levels of C-glyTrp. This is 
consistent with the strong association between 
C-glyTrp and birthweight and suggests that the rela- 
tionship between this metabolite and ageing may be 
related to apoptotic pathways. One of the other two 
probes whose association was not replicated mapped 
to the EDN2 gene which encodes for endothelin. 
Plasma levels of endothelin are increased in end- 
stage renal disease and have been implicated in 
renal inflammation and hypertension.^^ The second 
probe mapped to GLB1L3 encoding P-galactosidase- 
I-hke protein 3 which has been implicated in age-re- 
lated retinal degeneration.'^ 



Discussion 

In this study we report a panel of 22 biochemical 
metabolites which combined are strongly correlated 
with chronological age in humans (R^=59%). 
Importantly this panel can also account for some of 
the variation in ageing-related clinical traits measured 
as lung function and bone mineral density even after 
adjusting for chronological age. 



Table 3 Association between three methylation probes and C-glyTrp levels in combined discovery and replication samples 
(total « = 522) 











Discovery 


Replication 




Meta-analysis 


Probe 


Chr 


Position" 


Nearest gene 


beta 


SE 


beta 


SE 


beta 


SE 


P 


cgl2757143 


9 


140473704 


WDR85 


-0.25 


0.06 


-0.18 


0.04 


-0.20 


0.04 


2.9 X 10"*^ 


cg20367961 


1 


41950237 


EDN2 


-0.23 


0.06 


0.04 


0.05 


-0.08 


0.04 


4.7 X 10"^ 


cg25999867 


11 


134145940 


GLB1L3 


-0.21 


0.05 


-0.02 


0.03 


-0.06 


0.03 


3.0 X 10"^ 



^Base pair position on genome build 37. 
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Because of its novelty, we focused on one specific 
ageing-related metabolite, C-glyTrp. This metabolite 
has already been proposed as a particularly useful 
marker for normal renal function regardless of the 
age and muscle mass of the subjects. 

In our twin population, we found a strong relation- 
ship between C-glyTrp and birthweight, a well-known 
developmental determinant of health status in mid 
life and old age, suggesting that its levels may be 
influenced by early human development. However, 
other factors related to birthweight could also influ- 
ence levels of C-glyTrp and the data shown here do 
not demonstrate causality. The difference in birth- 
weight explained 9.4% of the variance in the differ- 
ence of C-glyTrp between genetically identical twins 
in a pair. The correlation between this metabolite and 
methylation at the promoter of WDR85, involved in 
regulating diphthamide synthesis, a key process for 
RNA translation, cell cycle and embryonic develop- 
ment, further suggests a role for this compound in 
both ageing and early development. 

We note some study limitations. First, there is a 
female predominance in our study sample (93% of 
the study sample are women). Second, birthweight 
is the only early life measure collected in the 
TwinsUK cohort and we could not test other early 
life measures which may have strengthened our 
data. Finally, we have been unable to access an inde- 
pendent population with combined birthweight and 
metabolomic data on which to replicate this result. 
Nonetheless, this result appears to be robust as the 
metabolite levels are not only strongly associated in 
independent individuals from the cohort, but are also 
replicated in within-pair differences in discordant 
identical twin pairs. 



Supplementary Data 

Supplementary data are available at IJE online. 



Funding 

This work was supported by: European Community's 
Seventh Framework programme (FP7/2007-2013) 
EurHEALTH Ageing HEALTH-F2-201 1-277849; Pfizer; 
the Wellcome Trust European Community's Seventh 
Framework Programme (FP7/2007-2013 to TwinsUK); 
ENGAGE project grant agreement (HEALTH-F4-2007- 
201413 to TwinsUK); the Department of Health via 
the National Institute for Health Research (NIHR) 
comprehensive Biomedical Research Centre award to 
Guy's & St Thomas NHS Foundation Trust in partner- 
ship with King's College London (to TwinsUK); ERC 
Advanced Principal Investigator award to T.D.S.; the 
Wellcome Trust (Grant 098051 to P.D. and L.T.); the 
German Federal Ministry of Education and Research 
(BMBF) and by the State of Bavaria to the Helmholtz 
Zentrum Miinchen-German Research Center for 
Environmental Health and Munich Center of Health 
Sciences (MC Health), Ludwig-Maximilians- 
Universitat, as part of LMU innovative and KORA; 
the Competence Network ASCONET, specifically the 
subnetwork COSYCONET, and the KORA-Age project 
funded by the German Federal Ministry of Education 
and Research to KORA. K.S. is supported by 
'Biomedical Research Program' funds at Weill 
Cornell Medical College in Qatar, a program funded 
by the Qatar Foundation. The statements made herein 
are solely the responsibility of the authors. 



Conclusions 

Observational and experimental evidence increasingly 
supports a relation between growth and development 
during foetal and infant life and health in later years, 
termed the developmental origins of health and dis- 
ease (DOHaD).^^ The data from the present study 
provide specific molecular insights for this hypothesis. 
The results illustrate how metabolomic profiling 
joined by epigenetic studies may help to identify 
novel molecular mechanisms implicated in subtle 
early life influences which produce long-term physio- 
logical changes that influence human health. 
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KEY MESSAGES 

• Using metabolomic profiling we have identified a specific metabolite, highly correlated with age and 
ageing traits, such as lung function and bone mineral density. 

• This metabolite, a glycosylated amino acid, is strongly associated with birthweight, a developmental 
determinant of healthy ageing. 
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• This glycosylated amino acid is also associated with methylation levels at a probe that maps to the 
promoter of a regulator of translation elongation factor 2. 

• The findings provide molecular mechanisms for the developmental origins of adult disease hypothesis 
and highlight the importance of epigenetic factors in this process. 
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